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We formalize a combinatorial principle, called the 3X0 R principle, due to Feige, Kim 
and Ofck [FKO06] , as a family of unsatisfiablc propositional formulas for which refutations 
04 , of small size in any propositional proof system that possesses the feasible interpolation 

property imply an efficient deterministic refutation algorithm for random 3SAT with n 
variables and fi(n 14 ) clauses. Such small size refutations would improve the current best 

■ (with respect to the clause density) efficient refutation algorithm, which works only for 
I □(n 1 - 5 ) many clauses [FO07]. 

^ ■ We then study the proof complexity of the above formulas in weak extensions of cut- 

, ting planes and resolution. Specifically, we show that there are polynomial-size refutations 

of the 3XOR principle in resolution operating with disjunctions of quadratic equations 
(with small integer coefficients), denoted R(quad). We show that R(quad) is weakly au- 

■ tomatizable iff R(lin) is weakly automatizable, where R(lin) is similar to R(quad) but with 
00 '. linear instead of quadratic equations (introduced in [RT08]). This reduces the question 

of the existence of efficient deterministic refutation algorithms for random 3SAT with n 
variables and fl(n 1A ) clauses to the question of feasible interpolation of R(quad) and to 
the weak automatizability of R(lin) . 

in 
o 

1 Introduction 

In the well known random 3SAT model one usually considers a distribution on formulas in 
^ ■ conjunctive normal form (CNF) with m clauses and three literals each, where each clause is 

chosen independently with repetitions out of all possible 2 3 • Q) clauses with n variables. The 
clause density of such a 3CNF is m/n. When m is greater than cn for sufficiently large c, 
that is, when the clause density is greater than c, it is known (and easily proved for c > 5.2) 
that with high probability a random 3CNF is unsatisfiable. 

A refutation algorithm for random /cCNFs is an algorithm that receives a fcCNF (with c 
sufficiently large) and outputs either "unsatisfiable" or "don't know"; if the algorithm an- 
swers "unsatisfiable" then the fcCNF is required to be indeed unsatisfiable; moreover, the 
algorithm should output "unsatisfiable" with high probability (namely, with probability 
1 — o(n) over the input /cCNFs). 
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Refutation algorithms for random fcCNFs were investigated in Goerdt and Krivelevich 
[GK01] and subsequent works by Goerdt and Lanka [GL03], Friedman, Goerdt and Krivele- 
vich [FGK05], Feige and Ofek [FO07] and Feige [Fei07] (among other works). The best (with 
respect to the clause density) refutation algorithm to date works for random 3CNFs with at 
least 0(n L5 ) clauses [FO07]. On the other hand, Feige, Kim and Ofek [FKO06] considered 
efficient non- deterministic refutation algorithms (in other words, short witnesses for unsatis- 
fiability of 3CNFs that can be checked for correctness in polynomial-time). They established 
the current best (again, with respect to the clause density) efficient, alas non- deterministic, 
refutation procedure: they showed that with probability converging to 1 a random 3CNF 
with n variables and Q(n 1,4 ) clauses has a polynomial-size (in n) witness. It is an open prob- 
lem to devise a deterministic polynomial-time (or even a quasipolynomial-time) refutation 
algorithm for random 3CNFs with n variables and Q(n 1A ) clauses. 

In this work we reduce the problem of devising an efficient deterministic refutation algo- 
rithm for random 3CNFs with S7(n L4 ) clauses to a problem in propositional proof complexity, 
namely the feasible interpolation problem. Informally, the feasible interpolation property 
gives a way to transform short refutations of an unsatisfiable formula A(x,y) A B(x,z), for 
x,y,z pairwise disjoint sets of variables, into small circuits that given an input a, sepa- 
rate the cases where A(a,y) is satisfiable from the cases where B(a,z) is satisfiable (see 
Definition 1). Feasible interpolation was proposed in [Kra94] and developed further in 
[Raz95, BPR97, Kra97]. We present a family of unsatisfiable propositional formulas de- 
noted Y n expressing a combinatorial principle whose short refutations in any propositional 
proof system that possesses the feasible interpolation property imply an efficient deterministic 
refutation algorithm for random 3CNFs with fi(n ) clauses: 

Theorem 1 If there exists a propositional proof system that has the feasible interpolation 
property and that admits uniform polynomial- size refutations ofT n , then there is a determin- 
istic polynomial-time refutation algorithm for random 3 CNF formulas with fi(n L4 ) clauses. 1 

This implies that determinizing the Feige et al. polynomial-time non-deterministic algo- 
rithm reduces to establishing the feasible interpolation property for those propositional proof 
systems that admit short refutations of the corresponding formulas T n (or to the problem 
of demonstrating polynomial-size proofs of T n in a proof system whose feasible interpolation 
property is already established). 

The argument is based on the following simple observation: we observe that the com- 
putationally hard part of the Feige, Kim and Ofek non-deterministic refutation algorithm 
(namely, the part we do not know how to efficiently compute deterministically) corresponds 
to a disjoint NP-pair (formally, it is a family of such pairs). The pair of disjoint NP sets 
are, first, the set of 3CNFs that have a certain combinatorial property, namely they contain 
a collection of sufficiently many inconsistent even k-tuples, as defined by Feige et al., and 
second, the set of 3CNFs with m clauses for which there exists an assignment that satisfies 
more than m — £ clauses as 3X0 Rs (for I a certain function of the number of variables n). 

It is worth noting that any efficient refutation algorithm (deterministic or not) corresponds 
to a disjoint NP-pair, via a sort of reflection principle. The reason is as follows: any refutation 

lr This theorem also holds if we consider feasible interpolation to be in quasipolynomial-size (which would 
then entail a corresponding quasipolynomial time refutation algorithm). 
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algorithm is based on some property P of CNFs that can be witnessed (or better, found) 
in polynomial-time. Thus, any refutation algorithm corresponds to a family of formulas 
P(x) — t--iSAT(x), expressing that if the input CNF has property P then x is unsatisfiable, 
and so P{x) and SAT(x) are two disjoint NP predicates. However, the disjoint NP-pair we 
work with is not of this type. That is, P{x) will not be the predicate for the full Feige, Kim 
and Ofek witnesses, rather a specific (combinatorial) predicate, that is only one ingredient in 
the definition of the Feige et al. witness. This saves us the trouble to formalize and prove in a 
weak prepositional proof system the full Feige et al. argument (such a full formalization was 
done inside TC°-Frege in [MT12]). For more information on the relation between disjoint 
NP-pairs and propositional proof complexity see for example [Pud03, AB04] . 

In the second part of this paper, we study the above suggested approach for improving 
the current known refutation algorithms, by considering short refutations for T n in seemingly 
weak proof systems that extend both the cutting planes system (cf. [CCT87, Pud97]) and 
Res(2) (the system Res(fc) is an extension of resolution that operates with fcDNFs instead 
of clauses, introduced by Krajfcek [KraOl]). Note that the cutting planes proof system itself 
has feasible interpolation [BPR97, Kra97, Pud97]. But on the other hand, it is unknown 
whether Fles(2) has feasible interpolation, while it is known by Atserias and Bonet [AB04] 
that Res(2) has feasible interpolation iff resolution is weakly automatizable (see below on 
weak automatizability) . 

More precisely, a family of R(quad) refutations is a family of refutations operating with 
disjunctions of quadratic equations, where each quadratic equation is of the form: 

^ ^ CijXiXj ~\- ^ ^ C-%X% ~\~ Cq — Qj 
i,j£[n] i£[n] 

such that Ci,Cij and a are integers written in unary representation. We have the following 
(generalized) resolution rule: from two disjunctions of quadratic equations \J i L i \/ L = a and 
Vj Lj V L' = b we can derive: 

\J Li V \J Lj V (L - L' = a — b). 

i j 

We also need to add axioms that will force our variables to be 0, 1 (see Section 5.1). 
We show the following: 

Theorem 2 R(quad) admits polynomial- size refutations of the 3XOR principle T n . 

This means that proving that R(quad) has the feasible interpolation (in polynomial-size 
or quasi-polynomial-size) would entail a deterministic (polynomial-time or quasipolynomial- 
time, respectively) refutation algorithm for random 3CNFs with Q(n ) clauses. 

Further, note that [MT12] proved the 3XOR principle in TC°-Frege, which we might 
conjecture to be a stronger proof system than R(quad). Thus, Theorem 2 gives a possibly 
tighter logical characterization of the 3XOR principle than [MT12]. On the other hand, 
[MT12] gave a polynomial-size proof for the correctness of the full Feige at al. witnesses (and 
not only the 3XOR principle). 

To describe our next result we recall the concept of automatizability which is central 
to proof-search algorithms. Given any propositional proof system, an important question 
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is whether one can find efficiently proofs in the system (regardless of their sizes). For this 
purpose the concept of automatizability was introduced by Bonet et al. [BPROO]: we say 
that a refutation system V (equivalently, a proof-system) is automatizable if there exists a 
polynomial-time algorithm that on input (r, l m ), where r is an unsatisfiable formula and 
l m is a string of m ones, the algorithm outputs a "P-refutation of r of size at most m in 
case such a refutation exists. Following Atserias and Bonet [AB04], we say that a refutation 
system V is weakly automatizable if there exists an automatizable refutation system V' that 
polynomially simulates V . Note that if V is not automatizable, it does not imply that V 1 is not 
automatizable. Hence, from the perspective of proof-search algorithms, weak automatizability 
is a more natural notion than automatizability (see [Pud03] on this). 

In [RT08], the system R(lin) was introduced which is similar to R(quad) above, except 
that all equations are linear instead of quadratic. Using similar arguments to Pudlak [Pud03], 
we show the following: 

Theorem 3 R(quad) is weakly automatizable iff R(lin) is weakly automatizable. 

Since weak automatizability of a proof system implies also that the proof system has feasible 
interpolation, we obtain the following reduction: 

Corollary 4 // R(lin) is weakly automatizable then there is a deterministic refutation algo- 
rithm for random 3CNFs with £l(n 1A ) clauses. 

1.0.1 Discussion 

The key point of this work is the reduction from constructing an efficient refutation algorithm 
for the clause density J7(n ' 4 ) to proving upper bounds in weak enough propositional proof 
systems for the 3XOR principle. On the one hand, this might be construed as hinting on a 
new way to improve current refutation algorithms. Note that even establishing a quasipoly- 
nomial interpolation for proof systems that admit short proofs of the 3XOR principle would 
be very interesting since the best known refutation algorithm for f2(n 1,4 ) clauses works in 
time 2 0( - n lo § n ) [FKO06]. However, it is probably safer to consider this reduction as empha- 
sizing the important consequences that establishing the feasible interpolation property for 
Res(2) and its (weak) extensions would have. For more recent work on the algorithmic im- 
plications of establishing feasible interpolation and automatizability for related propositional 
proof systems see [AM12, HP11]. 

The rationale behind choosing the R(quad) refutation system is that it seems to be a 
possibly weak propositional proof system that, loosely speaking, can both count and com- 
pose mappings, as we now explain. The cutting planes proof system can count to a certain 
extent; however, it is unknown whether cutting planes has short refutations of the Tseitin 
graph formulas [Tse68] (which are a kind of counting contradictions). By extending cutting 
planes (with small coefficients) to allow disjunctions of linear equations (instead of a single 
inequality), we obtain the system R(lin) (this is similar to R(quad) but with linear instead 
of quadratic equations). It was shown in [RT08] that even when we allow disjunctions of 
only constant many generalized 2 linear equations in every proof-line, R(lin) has short refuta- 

2 A generalized equation is an equation L £ S, for S C Z; which can be considered also as the disjunction 
Vses L = s - 
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tions of the Tseitin formulas; this shows that using (fairly restricted) disjunctions of linear 
equations allows to improve the counting abilities of cutting planes. 

The reason we need to use quadratic instead of linear equations is to be able to compose 
maps. As observed by Pudlak [Pud03], the reason why the k-Clique and (k — 1) -Coloring 
contradictions do not have short cutting planes refutations is that cutting planes cannot 
compose two mappings, which then makes it impossible to perform a routine reduction from 
the k-Clique and (k — 1)- Coloring contradiction to the pigeonhole principle contradiction (and 
the latter contradiction does admit short cutting planes refutations). This is why Pudlak 
introduced in [Pud03] the system CP 2 which is cutting planes operating with quadratic 
equations. The system R(quad) we work with is an extension of CP 2 (when the latter is 
restricted to small coefficients). 

2 Preliminaries 

Let F be a 3CNF with n variables X = {x±, . . . , x n } and m clauses. We denote {1, . . . , n} 
by [n]. The truth value of a formula G under the Boolean assignment A is written G(A). An 
assignment A satisfies as a 3XOR the clause ^Vf 2 V 4 if {i\ £ 2 © = 1 (where 

denotes the XOR operation, and the £iS are literals, namely variables or their negation). 

2.1 Disjoint NP-pairs and feasible interpolation of propositional proofs 

In this section we review the notion of a disjoint NP-pair and its relation to propositional 
proofs and the feasible interpolation property. 

A disjoint NP-pair is simply a pair of languages in NP that are disjoint. Let L, N be 
a disjoint NP-pair such that R(x,y) is the corresponding relation for L and Q(x,z) is the 
corresponding relation for N; namely, R(x, y) and Q(x, z) are polynomial-time relations such 
that x £ L iff 3y, \y\ = poly(|x|) A R(x, y) = true and x € N iff 3z, |z| = poly(|x|) A Q(x, z) = 
true. Then, for every n £ N, there exists an unsatisfiable CNF formula in three mutually 
disjoint sets of variables x, y,z: 

F n := A(x,y) AB(x,z), (1) 

where A(x,y) and B(x, z) are the CNF formulas expressing that R(x, y) and Q(x, z) are true 
for x of length n, respectively. 

More formally, A(x,y) is a CNF in the variables x = (x\, . . . , x n ) and y = (yi, ■ ■ ■ ,ye), 
that is true iff R(x,y) is true, and B(x,z) is a CNF in the variables x and z = (zi, . . . , z m ), 
that is true iff Q(x, z) is true (for some £, m that are polynomial in n). Since both polynomial- 
time relations R(x, y) and Q(x, z) can be converted into a family of polynomial-size Boolean 
circuits, and thus can be written as a family of polynomial-size CNF formulas (by adding 
extension variables, that we may assume are incorporated in the certificates y and z), both 
the formulas A(x,y) and B(x,z) can be written as polynomial-size CNF formulas. 

Note that because y and z are disjoint sets of variables and A(x,y)AB(x,z) is unsatisfiable, 
it must be that given any x £ {0, l} n , either A(x, y) or B(x,z) is unsatisfiable (or both). 

Feasible Interpolation. We use standard notation and notions from the theory of proposi- 
tional proof complexity (see [BP98, Seg07, CK02, Kra95] for surveys and introductions to the 
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field). In particular, we sometimes mix between refutations (that is, proofs of unsatisfiability 
of a formula) and proofs (that is, a proof of the negation of an unsatisfiable formula, which 
must be a tautology). From the perspective of proof complexity refutations of contradictions 
and proofs of tautologies are mostly the same. 

Definition 1 (Interpolation property) A propositional proof system V is said to have 
the interpolation property in size f(n) if the existence of a V -refutation of A(x,y) A B(x,z) 
as in (1) of size s implies the existence of (usually a Boolean) circuit C(x) of size f(s), called 
the interpolant of A(x,y) A B(x,~z), such that 



In other words, if only A(a,y) is unsatisfiable then C(a) = 1 and if only B(a,~z) is unsatis- 
fiable then C(a) = 0, and if both A(a,y) and B(a,~z) are unsatisfiable then C(a) can output 
either or 1. When a proof system V has the interpolation property in size poly(n) we say 
that V has the feasible interpolation property, or simply that V has feasible interpolation. 

// the V -refutations of the family of formulas F n (in (I) above) are uniform, in the sense 
that there is a polynomial-time algorithm that can construct the refutations of F n given n in 
unary representation, then the interpolant circuit C of F n above is assumed to be uniform 
as well (and so ifV has feasible interpolation then there is a polynomial-time algorithm that 
behaves as the circuit C in (2)). 

Note that L (as defined above) is precisely the set of those assignments a for which A(a, y) 
is satisfiable, and N is precisely the set of those assignments a for which B(a,z) is satisfiable, 
and L and ./V are disjoint by assumption, and so C(x) separates L from N; namely, it outputs 
different values for those elements in L and those elements in N. 

It is already known that quite a few propositional proof systems posses the feasible in- 
terpolation property. Such systems include for instance resolution [BPR97, Pud97, Kra97], 
cutting planes [IPU94, BPR97, Pud97, HC99] and the polynomial calculus [CEI96]. On the 
other hand it is known (conditioned on hardness assumptions from complexity theory) that 
several other proof systems do not have the feasible interpolation property; these include 
threshold logic (namely, TC°-Frcgc) [BPR00] and bounded depth Frege [BDG+99] 



The following definitions and proposition is due to Feige et al. [FKO06]. 

Definition 2 (Inconsistent even fc-tuple) An even /c-tuple is a tuple ofk many 3-clauses 
in which every variable appears even times. An inconsistent even fe-tuple is an even k-tuple 
in which the total number of negative literals is odd. 

Note that for any even /c-tuple, k must be an even number (since by assumption the total 
number of variables occurrences 3k is even). The following is the combinatorial principle, 
due to Feige et al. [FKO06] that we consider in this work: 



C{a) = 1 
C(a) = 



A(a,y) is unsatisfiable; and 
B(a,~z) is unsatisfiable. 




3 The 3XOR principle 
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The 3XOR principle: Let K be a 3CNF over the variables X. Let S be t inconsistent 
even fc-tuples from K, such that every clause from K appears in at most d inconsistent even 
/c-tuples in 5. Then, given any Boolean assignment to the variables X, the number of clauses 
in K that are unsatisfied by the assignment as 3XOR is at least \t/d~\. 

The correctness of the 3XOR principle follows directly from the following proposition: 

Proposition 5 ([FKO06]) For any inconsistent even k-tuple (over the variables X) and 
any Boolean assignment A to X, there must be a clause in the k-tuple that is unsatisfied as 
3X0 R. 

Proof: Assume by a way of contradiction that for some assignment A every clause from the 
A;- tuple is satisfied as a 3XOR. Remember that k must be even. Thus, if we sum modulo 2 
all the literals in the fc-tuple via clauses, then since k is even we get that the sum equals 
modulo 2. 

On the other hand, if we count via literals then summing modulo 2 all literals £i(A) in 
the fe-tuple, we get 1 (modulo 2), for the following reason. First, we sum all variables Xi that 
have odd number of negative occurrences. Because Xi appears an even number of times in 
the fc-tuple, the number of positive occurrences of xi is also odd. So in total all occurrences 
of Xi(A) and —>Xi(A) contribute 1 to our sum (modulo 2). There must be an odd number of 
such variables xi in our fc-tuple because the £;-tuple is inconsistent. Thus this sums up to 1 
(modulo 2). Then we add to this sum those variables that have an even number of negative 
occurrences (and hence also an even number of positive occurrences); but they cancel out 
when summing their values under A modulo 2, and so they contribute to the total sum. 
Hence, we get 1 as the total sum. This contradicts the counting in the previous paragraph 
which turned out 0. ■ 

Note that the 3XOR principle stems directly from the Proposition 5 and the fact that 
every clause in K appears in at most d even £>tuples in S. 



4 From short proofs to refutation algorithms 

In this section we demonstrate that polynomial-size proofs of (an encoding of the) 3XOR 
principle in a proof system that has the feasible interpolation property yield deterministic 
polynomial-time refutation algorithms for random 3CNF formulas with Q,(n 1A ) clauses. 

4.1 The witness for unsatisfiability 

Feige, Kim and Ofek nondeterministic refutation algorithm [FKO06] is based on the existence 
of a polynomial-size witness of unsatisfiability for most 3CNF formulas with sufficiently large 
clause to variable ratio. The witness has several parts, but as already observed in [FKO06], 
apart from the t inconsistent even fc-tuples (Definition 2), all the other parts of the witness 
are known to be computable in polynomial-time. In what follows we define the witnesses for 
unsatisfiability. 

Let K be a 3CNF with n variables x\, . . . ,x n and m clauses. The imbalance of a variable 
Xi is the absolute value of the difference between the number of its positive occurrences and 
the number of its negative occurrences. The imbalance of K is the sum over the imbalances 
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of all variables, denoted I(K). We define M{K) to be an n x n rational matrix such that 
Mij equals \ times the number of clauses in K where X{ and xj appear with different signs 
minus | times the number of clauses where they appear with the same sign. In other words, 
for each clause in K in which x\ and xj appear with the same sign we add ^ and for each 
clause in K in which and Xj appear with different signs we subtract ^. Let A be a rational 
approximation of the biggest eigenvalue of M{K). (It is enough to use polynomially small 
rational approximations; this was already noted in [FKO06]; see also [MT12] for a detailed 
treatment of rational approximations in this context.) 

Definition 3 (FKO witness) Given a 3CNF K, the FKO witness for the unsatisfiability 
of K is defined to be the following collection: 

1. the imbalance I(K); 

2. the matrix M{K) and the (polynomially small) rational approximation A of its largest 
eigenvalue; 

3. a collection S consisting of t < n 2 inconsistent even k-tuples such that every clause in 
K appears in at most d many even k-tuples; 

4- the inequality t > d '( I ( K )+ Xn ) _j_ a ^ must hold. 

(The notation o(l) above stands for a specific rational number b/n c , for c a constant and b a 
positive integer.) 

We have the following: 

Theorem 6 ([FKO06]) For a random 3CNF K with n variables and Q,(n 1A ) clauses, with 
probability converging to 1 as n tends to infinity there exist natural numbers k = 0(n°' 2 ), t = 
f2(n ) (where t < n 2 ) and d = 0{k) = 0(n°' 2 ), such that K has a witness for unsatisfiability 
as in Definition 3. 

It can be shown that given a 3CNF K and the parameter k, we can compute in polynomial 
time the values of t and d, for which the theorem holds. 

Inspecting the argument in [FKO06] , one can see that it is sufficient to replace part 3 in 
the witness with a witness for the following: 

3'. No assignment can satisfy more than m — \tjd\ clauses in K as 3XORs. 

Therefore, since I(K), M{K) and A are all polynomial-time computable (see [FKO06] 
for this), in order to determinize the non-deterministic refutation algorithm of [FKO06] it is 
sufficient to provide an algorithm that almost surely determines (correctly) that part 3' above 
holds. In other words, in order to construct an efficient refutation algorithm for random 
3CNFs (with Q(n 1A ) clauses) it is sufficient to have a deterministic algorithm A that on 
every input 3CNF answers either "condition 3' is correct" or "don't know", such that 
A is never wrong (i.e., if it says "condition 3' is correct" then condition 3' holds) and 
with probability 1 — o(n) over the input 3CNFs most inputs A answers "condition 3' is 
correct". Note that we do not need to actually find the Feige et al. witness nor do we need 
to decide if it exists or not. The relation between unsatisfiability and bounding the number 
of clauses that can be satisfied as 3XOR in a 3CNF was introduced by Feige in [Fei02] (and 
used in [FO07] as well as in [FKO06]). 
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4.2 The disjoint NP-pair corresponding to the 3XOR principle 



For every choice of t, d and k functions of n (that are polynomially in n computable; we 
assume also that t(n) < n 2 ), we define the corresponding 3X0R principle disjoint NP-pair 
as the pair (L,N), where: 

L := {x | x is a 3CNF with n variables and m clauses for which 

there exists t(n) inconsistent even fc(n)-tuples, and 

every clause of x appears in at most d(n) fc(n)-tuples}, 

N : = jx | x is a 3CNF with n variables and m clauses for which 

there exists an assignment that satisfies at least 

m — (I(x) + Xn)/2 clauses as 3XOR}, 

where I(x) in the definition of TV" is the imbalance of x and A is the (polynomially small) 
rational approximation of the largest eigenvalue of M(x) as described in Section 4.1. 

Formally, for every choice of t, k, d as functions of n there is a different corresponding 
disjoint NP-pair. However, for the sake convenience we do not specify explicitly the chosen 
t, k, d in the notation L, N as it will be clear from the context which t, k, d we use. It is easy 
to verify that both L and N are indeed NP sets for every choice of t < n 2 , d, k, m, and that, 
by the 3XOR Principle, L n N = 0. 

Using the same notation as in Section 2.1, we denote by R(x,y) and Q(x,z) the 
polynomial-time relations for L and N, respectively. Further, for every n € N, there ex- 
ists an unsatisfiable CNF formula in three mutually disjoint sets of variables x,y,z: 

T n :=A(x,y)AB(x,z), (3) 

where A(x,y) and B(x,~z) are the CNF formulas expressing that R(x, y) and Q(x, z) are true 
for x of length n, respectively. 

Corollary 7 If there exists a propositional proof system that possesses the feasible interpo- 
lation property and that admits uniform polynomial- size refutations of T n , then there is a 
deterministic polynomial-time refutation algorithm for random 3 CNF formulas with Q,(n 1A ) 
clauses. 

Note: For the consequence in the corollary to hold it is enough to assume that there are 
polynomial-size refutations of T n where the parameters are set as follows: k = 0(n°' 2 ), 
t = il(n L4 ) (and also t < n 2 ), d = O(k) = 0(n 0,2 ); however, for the short proofs in the next 
section we only need to assume that t is polynomial in n. 

Proof: By assumption, and by the definition of the feasible interpolation property, for every 
choice of functions t, d, k of n (that are also polynomially in n computable) there exists a 
deterministic polynomial-time interpolant algorithm A that on input a 3CNF K, if A(ET) = 1 
then K L and if A(K) = then K N. 

The desired refutation algorithm then works as follows: it receives the 3CNF K and for 
all polynomially many choices of t < n 2 , d, k it runs A(-RT). If for one of these runs A(if) = 
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then we know that K ^ N, and we answer "unsatisf iable" . Otherwise, we answer "don't 
know" . 

The correctness of this algorithm stems from the following two points: 

(i) If we answered "unsatisf iable", K is indeed unsatisfiable, since K £" N, and by Section 
4.1 this means that K is unsatisfiable. 

(ii) For almost all 3CNFs we will answer "unsatisfiable" because almost all of them will 
have an FKO witness (by Theorem 6), which means that K £ L for some choice of t < n 2 ,d,k 
and hence the interpolant algorithm A must output in at least one of these cases (because 
k{K) = 1 means that K £ L). m 



5 Propositional formula encoding the 3XOR principle 

In this section we define the propositional refutation system in which we are going to demon- 
strate polynomial-size refutations of the 3XOR principle. We then give an explicit encoding 
of the 3XOR principle as an unsatisfiable set of disjunctions of linear equations. 

5.1 The propositional refutation systems R(lin) and R(quad) 

The refutation system in which we shall prove the unsatisfiability of the 3XOR principle is 
denoted R(quad). It is an extension of the refutation system R(lin) introduced in [RT08]. 
The system R(lin) operates with disjunctions of linear equations with integer coefficients and 
R(quad) operates with disjunctions of quadratic equations with integer coefficients, where 
in both cases the coefficients are written in unary representation. We also add axioms that 
force all variables to be 0, 1. First we define the refutation system R(lin). 

The size of a linear equation a\X\ + ... -\-a n x n + a n+ \ = a® is defined to be Y17=o \ai\, that 
is, the sum of the bit sizes of all en written in unary notation. The size of a disjunction of 
linear equations is the total size of all linear equations in it. The size of a quadratic equation 
and of a disjunction of quadratic equations is defined in a similar manner (now counting the 
size of the constant coefficients, the coefficients of the linear terms and the coefficients of the 
quadratic terms). The empty disjunction is unsatisfiable and stands for the truth value false. 

Notation: For L a linear or quadratic sum and S C Z, we write L £ S, to denote the 
disjunction \J seS L = s. We call L £ S a generalized linear (or quadratic) equation. 

Definition 4 (R(lin)) Let K := {K\, . . . , K m } be a collection of disjunctions of linear equa- 
tions in the variables x±, . . . , x n . An R(lin)-proof from K of a disjunction of linear equations 
D is a finite sequence tt = (Di, ...,Di) of disjunctions of linear equations, such that Di = D 
and for every i £ [£] one of the following holds: 

1. Di = Kj, for some j € [to]; 

2. Di is a Boolean axiom xt £ {0, 1}, for some t £ [n]; 

3. Di was deduced by one of the following R(lin) -inference rules, using Dj,Dk for some 
j,k < i: 
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Resolution Let A, B be two, possibly empty, disjunctions of linear equations and let 
Li, L2 be two linear equations. From A V L\ and B V L2 derive A V B V {L\ — L2). 
(We assume that every linear form with n variables is written as a sum of at most 
n + 1 monomials. 3 ) 

Weakening From a possibly empty disjunction of linear equations A derive A\I L , 
where L is an arbitrary linear equation over the variables xi, . . . , x n . 

Simplification From A V (0 = k) derive A, where A is a possibly empty disjunction 
of linear equations and k 7^ 0. 

An R(lin) refutation of a collection of disjunctions of linear equations K is a proof of the 
empty disjunction from K. The size of an R(lin) proof it is the total size of all the disjunctions 
of linear equations in tt (where coefficients are written in unary representation). 

Definition 5 (R(quad)) The system R(quad) is similar to R(lin) except that proof-lines can 
be disjunctions of quadratic equations with integer coefficients ^ • CijXiXj + J2i °i x i + c = S 
instead of linear equations; and the Boolean axioms are now defined for all i,j G [n], as 
follows: 

Xi G {0, 1}, Xi + xj - XiXj G {0, 1}, Xi - XiXj G {0, 1} . 

The size of an R(quad) refutation is the total size of all the proof-lines in it. 

Both R(lin) and R(quad) can be proved to be sound and complete (for their respective 
languages, namely, disjunctions of linear and quadratic equations, respectively) refutation 
systems. 

5.2 The formula 

We now describe the formula T n encoding the 3XOR principle (the formula depends also on 
the parameters t,m and k, but we shall suppress these subscripts). The formula will have 
the correct form for application of the feasible interpolation property, namely it will be an 
unsatisfiable set of formulas consisting of two groups: the first will be the formulas in the 
X, Y variables only and the second will be the formulas in the X, Z variables only. We also 
have variables that encode a product of two variables, namely, (extension) variables for which 
a certain formula in one of the groups forces them to behave like products of two variables 
from X, Y, Z. Since we cannot use the Y variables in the second part of the formula and we 
cannot use the Z variables in the first part of the formula, we can encode only products of 
variables from X, Y and from X, Z, but not products of a Y variable with a Z variable. 

It will be convenient sometimes to denote by Xi +n the literal -iXj, when it is assumed we 
use the n variables xi, ...,x n in the 3CNF encoded by X. 

Variables and their meaning. The variables X correspond to the input 3CNF with n 
variables. The variables Y correspond to the collections of t many inconsistent even £;-tuples. 
The Z = {zi, z n } variables stand for a Boolean assignment for the n variables of the 3CNF. 

3 Accordingly, in R(quad) we assume that every quadratic sum with n variables is written as a sum of at 
most 1 + 2n + Q) monomials. 
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(Note that we use the variables Xi for the variables in the 3CNF and the variables Xij for the 
variables in our encoding of the 3CNF.) 

The input 3CNF X is encoded as a 3m x 2n table X, where each block of three rows 
corresponds to a clause and columns from 1 to n correspond to positive literal occurrences 
and columns n + 1 to 2n correspond to negative literal occurrence. Formally, let 1 < i = 
3 ■ I + r < 3m, where r G {0, 1, 2}, I 6 [n], and let j G [2n]. Then Xij = 1 means that the rth 
literal in the Ith clause in the input 3CNF is: 

Xj if j < n, and ~^Xj_ n , if j > n. 

The collection of t inconsistent k even tuples is encoded as t tables, each table is encoded 

( s \ ( s \ 

by the variables Y , for s G [t]. Each Y represents a table of dimension k x m, where 

(s) 

y)jl = 1 iff the jth member in the sth A:-tuple is the Zth clause (meaning the Ith clause in the 
input 3CNF encoded by X). 

Group I of formulas (containing only X,Y): 

1. Every row in X contains exactly one 1: 

2n 

^^Xjj = 1, for every i £ [3m] . 

— (s) 

2. Every row in Y contains exactly one 1: 

m 

X>i S) = 1 ' for all s € [t],i e [k]. 
i=i 

3. Every column in Y contains at most one 1: 

k 

G {0, 1}, for alls G [t],j€ [m]. 

i=l 

4. For any j G [k],r G [m],s G [t],£ G [3m], i G [2n], we introduce the new single formal 
variable -x^j\ which will stand for the product of two other formal variables y^} -xa . 
For this we shall have the following axioms: 

v$-[v$-xiile {0,1} and ^-[yj?-x«]€{0,l} 

and 

v$ + x« - [yj? ■ s«] g {o, 1} 

As an abbreviation (not a formal variable) we define the following: 

m 

Qifh := ^2lyjr • x (3(r-l)+h)il > for a11 * = t 2n ] and /i G {0, 1, 2} and s G [i], 

r=l 

— (s) 

which expresses that Xj occurs as the /ith literal in the jth clause of Y 
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5. We express that all the Y 's are even fc-tuples (that is, that every variable Xi appears 
even times) by: 

E Qirh + Q^+n)rh G {°> 2 > 4 > - fc }> for a11 ^ [n], 8 G [*]. 

re[fc],/i=0,l,2 

We can assume that is even, since for every even fc-tuple fc must be even. 

/ s \ 

6. Similarly, we encode that the Y 's are inconsistent (that is, the number of negative 
literals in them is odd) by: 

E Q(in M e{0,3,5,...,/c-l}. 

re[fc],h=0,l,2 
t6[n] 

7. Every clause i G [m] appears in at most d even fe-tuples Y 1 , Y"^ . We put: 

E ^j'i G ^ or ever y * e 

ie[fc],se[t] 



This finishes the encoding of the t inconsistent even fe-tuples. 

Group II of formulas (containing only X, Z): We now turn to the formulas expressing 
that there are assignments Z that satisfy more than to — \t/d~\ clauses in X as 3X0 Rs. For 
every j E [3m], i G [2n],£ G [n], let \xji ■ zi} be a new formal variable that stands for the 
product Xji ■ zg. As in part 4 of the formula above, we include the axioms that force \xji ■ zgj 
to stand for Xji ■ zg. 

Let us use the following abbreviation: 

(n n \ 

E [ z (3(i-l)+fc)i • Z i] + E ( X (3(j-l)+h)(i+n) - [3(3Cj-l)+/0(t+n) ' «i] ) J • 
t=l i=l / 

Then, £/j G {1,3} states that the jth. clause in X is satisfied as 3X0R by Z. Note that 
2;(3(j_i) + / l )(j +n ) — [x(3(j_i) + M(i +n ) • Zj] is a linear term that expresses the quadratic term 
X(3(j-i)+h)(i+n) ■ (1 - 

8. Let ttj be a new formal variable expressing that the jth clause in X is satisfied as 3X0R 
by Z. Hence, Uj G {1,3} iff Uj = 1, and we encode it as: 

Uj G {0, 2} V (uj = 1) and Uj G {1, 3} V (uj = 0), 

9. There are assignments Z that satisfy more than m — \t/d~\ clauses in X as 3X0 Rs: 

m 

Uj G {m — \t/d\ + 1, to}. 

3=1 

The set of formulas described in this section has no 0, 1 solution by virtue of the 3XOR 
principle itself (Section 4.1). 
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6 Short refutations for the 3XOR principle 



In this section we demonstrate polynomial-size (in n) R(quad) refutations of the 3X0R 
principle as encoded by disjunctions of linear equations in the previous section. We sometimes 
give only a high level description of the derivations. We use the terminology and abbreviations 
in Section 5. We also use freely the ability of R(lin) (and hence R(quad)) to count. For a 
detailed treatment of efficient counting arguments inside R(lin) see [RT08] . 

Step 1: Working in R(quad), we first show that our axioms prove that Z cannot satisfy as 

3X0 R all clauses of Y , for any s £ [t]. 
Recall from Section 5 the abbreviation 

m 

Qijh ■= • Z(3(r-i)+ft)i] , for all i = [2n] and h G {0, 1, 2} and s G [t], 

r=l 

( s \ 

which stands for the statement that x% occurs as the hth. literal in the jth clause of Y (and 
where xi for % > n stands for the literal —iXi- n ). Let us use the abbreviation: 

n n 
i=l i=l 

Then, Pjhs is a quadratic sum that stands for the statement that the /ith literal in clause j 
in Y^ is true under Z. Thus, 

PjOs + PjU + Pj2s G {1, 3}, for all j G [k] (4) 

expresses that all the clauses in Y are satisfied as 3XOR under Z. 

Our goal now is to refute (4), based on our axioms. Informally, this refutation is done 

i s \ 

by counting: first count by clauses in Y , namely, add all left hand sides of (4) together 
reaching an even number (in the right hand side) by virtue of k being even (recall we can 
assume that k is even). Then, count by literals, namely sum all values of literals in Y^ 
under the assignment Z, which we can prove is odd from our axioms. We now describe this 
refutation more formally. 

( s \ 

Since k is even, counting by clauses in Y , namely, adding the left hand sides of (4) gives 
us easily the following (with a polynomial-size R(quad) proof): 

k 

E PjOs + Pjn + Pj2s G {0, 2, 4, .., 3k}. (5) 

3=1 

— ( s ) 

Now we need to count by literals in Y . We can abbreviate the number of occurrences in 
( s \ 

Y of the literal Xi , for i G [2n] , s G [t] , by: 

Ti '■= E Qijh ■ 
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Let us abbreviate by Si the contribution of the literals Xj and —>Xi to the total sum (5). Thus 

Si'= E Q§1'«+ E Q&nyh-i 1 -*)- 

ie[fc] je[fc] 

/i=0,l,2 h=0,l,2 

It is possible to prove the following: 

Ti G {0, 2, 4, k} V 5i G {1, 3, 5, k - 1} (6) 

which states that if the number of occurrences in Y of the literal Xj is odd then (since by 
our axioms stating that every variable occurs even times, the number of occurrences of the 
literal -iscj must also be odd) the contribution of Xi and —>Xi to the total sum (5) is also odd 
(because either z% = or zi = 1). 

By the axioms saying that the number of negative literals is odd (axiom 6) we get that: 

n 

Y,T i+n G {1,3,5,...,* -ra-1}. (7) 

i=l 

— ( s ) 

And from the axioms stating that each variable occurs even times in Y we have: 

Ti + Ti +n G {0, 2, 4, k}, far all i€[n]. (8) 

From (8) we obtain X^i=i ^ {0' 2, 4, k ■ n}, and from this and (7) we obtain 

n 

5^Ti€ {1,3,5, (9) 

t=i 

Note that (6) can be interpreted as saying that if Tj is odd then so does Si. Accordingly, one 
can use (6) to substitute all Ti,...,T n in (9) with Si,...,S n , respectively. We thus get that 
the total sum in the left hand side of (5) is in {1, 3, 5, ...}, and we obtain a contradiction with 
(5). 

From a refutation of the collection of disjunctions (4), for any s G [t], we can actually get 
the negation of this collection, that is: 

V (P i0s + Pju + P j2s ) G {0, 2}. (10) 

i elk] 

This stems from the following: it is already true in resolution that if we have a size 7 resolution 
refutation of A\,...,Ai, then assuming the axioms A\ V B%, Ai V Bi, we can have a size 
0(7 • d) resolution derivation of B\ V ... V Bi, given that the total size of the B^s is d. To 
see this, take the resolution refutation of Ai,..., A\ and OR every line in this refutation with 
B\M . .. V ' Bi (note that the resulting new axioms are actually derivable from the axioms Ai V Bi 
via Weakening) . Now, to get (10) from (4), we do the same, putting Pjo s + Pjis + Pj2s G {1,3} 
instead of Aj and Pjo s + Pj\ s + Pj2 S G {0,2} instead of Bj, for all j G [k], noting that: 

(P j0s + P jls + P j2s G {1, 3}) V (P j0s + Pji, + Pj2s G {0, 2}), for all j G [k\. (11) 
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Step 2: The next step in our R(quad) refutation is showing how to obtain the final contra- 
diction, given the collection of formulas (10), for all s G [t]. This is again by counting: we 
know that for every truth assignment Z, each Y^, 7" must contribute at least one clause 
from X that is unsatisfiable as 3XOR under Z. We can view this as a mapping g : [t] — > [m] 
from yW, to the m clauses in X, such that g(i) = j means that 7" contributes 
the clause j in X that is unsatisfiable under Z as 3XOR. The mapping g is not 1-to-l, but 
d-to-1, because every clause of X can appear at most d times in ...,Y( S \ Our R(quad) 
refutation proceeds as follows. 

By assumption we have YlT=i n « ^ { m ~ fV^l + 1> ■ ■■,m}, meaning that the number of 
clauses in X that are satisfied as 3XOR under the assignment Z is at least m — \t/d~\ + 1. 
Also, by the axioms in our formula, for all i G [m] we can prove that Ui = 1 implies that 
Ui G {1, 3}; namely that the number of true literals in the ith clause of X is 1 or 3. 

For any s£ [t] , we can think of Y as a mapping : [k] -> [m] that maps the fc clauses 
in to the clauses in X. Then, yf) = 1 means that f^ s \i) = j. Thus, u% ■ y^f = 1 means 

that the jth clause in Y is the ith clause in X and that the ith clause in X is satisfiable 
as 3X0 R under Z. 

Now, it is possible to show that for any s G [t], i G [m] and j G [A;], there is a proof of the 
following line: 

(ui ■ = 0) V (P i0a + P ils + P j2s G {1, 3}) (12) 
which states that if the ith clause in X is satisfied as 3XOR under the assignment Z and 

— (s) — — (s) 

the jth clause in Y maps to the ith clause in X, then the jth clause in Y is satisfied as 
3XOR under Z. 

Informally, the proof of (12) is explained as follows: the term Pjo s + Pji s + Pj2 S can be 

— ( s ) 

seen as the addition, denoted S, of all inner products of the jth row of Y with the columns 
of X (for each h = 0, 1, 2 we can consider the column of X restricted to the h ■ i rows only 

(i G [m]), and so a row of Y which is of length m can have an inner product with such a 

column of length m in X). Because we assume that yf) = 1, only the ith coordinate in the 
( s \ 

jth row of Y is 1 (and all the other entries in this row are 0, by our axioms). Thus, S 
equals in fact a single column from X; and this single column is precisely Ui. 
From (12) and (10) we can derive, for any s G [t] and any i G [m]: 

V (vj? • (1 " u i) = l ) . ( 13 ) 

je[k],ie[m] 

stating that for some j G [k], i G [m], the jth clause in Y^ is the ith clause in X and the ith 
clause in X is not satisfied as 3XOR under Z. 

Now, from (13) and axioms 7 in the 3XOR principle formulas, stating that g : [t] — > [m] 
is d-to-1, we can obtain that the number of it,'s that are true is no more than m — \t/d~\ , that 
is, Yliehn] Ui e {0, ...,m — [i/cf]}, contradicting the axiom Y^JLi u j € {m — \t/d~\ +1, ...,m}. 
The formal proofs of this in R(quad) is shown in the following lemma: 

Lemma 8 There are polynomial- size R(quad) refutations of (13) and the axioms in parts 7 
and 9 in the 3XOR principle. 
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je[k], S £[t] 

[ml 



Proof: First sum all axioms (7) to obtain: 

E yfj G{0,l,...,d-m}. (14) 

From (13) we can obtain: 

E yff ■ (I- Ui) e {1,2, ...,k-m}, for every s e [£]. 

ie[fc],re[m] 

And by summing this for all s € [t] and i £ [m], we get: 

E E #■(!-«*)= E (!-«*)■ E # 

iGfml je[*],re[m] iefml je[fc],T-g[m] 

»e[t] »6[t] 

£ {t ■ m,t ■ m+ 1, ...,t ■ k ■ m 2 }. (15) 

From the axiom in part (9) in the 3XOR principle X^=i u i ^ { m ~ [V^l +!)•■•) m } we can 
obtain easily 

E(!-«i) e{o,i,...,\t/d] -i}. 

ie[m] 

From this and (14) we get, via Lemma 9 proved below, the following: 

E(i-0- E yj?e{o,i,...,d.m-(rt/rfl-i)}. 

i£[m] j'6[*],«e[t] 

r £ [m] 

Since d • m • (\t/d~\ — 1) < d ■ m ■ \t/d~\ < m ■ t, we obtain a contradiction with (15), which 
finishes the refutation. ■ 

It remains to prove Lemma 9, which was used in the above proof: 

Lemma 9 Let ^2 ie jXi S {0,1, ...,n} and YljejUj £ {0,l,...,m} 6e disjunctions of linear 
equations, both of size at most s. Given these two disjunctions we can prove in R(quad) with 
a polynomial-size in s proof, the following: 

E Xi ' E y i G {0,l,...,m-n}. (16) 

iei jeJ 

Proof: We can reason in a case-by-case manner as follows (see [RT08] on how to carry out 
informal case-analysis reasoning inside R(lin)): assume that X^eJ%' = a > f° r a ^ {0> 1> •••> m l- 
We wish to show that x\ • YljejVj = ax i- If ^i = then x\ ■ J2j € jyj = = ax\. Otherwise, 
X\ = 1. Then, x\ ■ Ylje J Uj = Sjg j Vi = a = ax i- Since we have the axiom (x\ = 0) V (x\ = 1) 
we conclude that x\ ■ X^/eJ Uj = ax i- ^ n a similar way we can derive for all i £ I: 

Xi-^yj = axi. (17) 
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And by adding (17) for all i G I we obtain: 

iei jeJ iei 
Now using the axiom Yliel Xi e 1, — jfi}, we get 

' 5^%' G {0,a,2a, ...,n- a}. (18) 

iei jeJ 

Recall that (18) was obtained under the assumption that YljeJ = ° - This means that if 
we have the axiom YljejVj e {A 1; m}, we can obtain: 

• yj G {6 • c | 6 G {0, 1, .., n} and c G {0, 1, m}} = {0, 1, n ■ m}. 

iei jeJ 



Note that the proof of Lemma 9 would also work if instead of the sums X^e/ Xi or Sje j Vj 
we have ^2 i&I biXi or ^2 j&J Cjyj, for integers bi,Cj. 



7 Reduction to weak automatizability of R(lin) 

Here we show that R(lin) is weakly automatizable if and only if R(quad) is weakly automa- 
tizable. 

To show that R(lin) is weakly automatizable iff R(quad) is weakly automatizable we use 
a similar idea to Pudlak [Pud03]. Namely, we show that the canonical pair of R(quad) is 
polynomially reducible to the canonical pair of R(lin). 

Definition 6 ([Raz94]) The canonical pair of a refutation system V is the disjoint NP- 
pair, whose first NP language consists of all pairs (r, l m ) where r is an unsatisfiable formula 
that has a V -refutation of size at most m, and whose second NP language is the set of pairs 
(fj,, l m ) where /i is a satisfiable formula and m is some natural number. 

We say that a canonical pair (A, B) of a refutation system V' is polynomially reducible 
to the canonical pair (^4', B') of another refutation system V if there is a polynomial-time 
computable function / such that for all x it holds that x G A <^=> f(x) G A' and x G 
B <J=^ f{x) G B' . A simple corollary of the above definitions is the following: 

Proposition 10 ([Pud03]) If the canonical pair of V is polynomially reducible to the 
canonical pair of V then V' is weakly automatizable if V is weakly automatizable. 

In view of this proposition, and since R(quad) clearly polynomially simulates R(lin) (as 
an extension of it), it remains to show the following: 

Proposition 11 The canonical pair of R(quad) is polynomially reducible to the canonical 
pair o/R(lin). 
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Proof: (Sketch) Similar to [Pud03], the idea is to encode a product of any two variables X{ - Xj 
as a new single formal variable x^j. Thus, the reduction sends all pairs (r, l m ) to the pair 
(r',lP ol y( m )), where t' is obtained from r by adding the axioms that force all new variables 
Xij to encode the product x^ ■ Xj, as shown in Section 5. ■ 

Corollary 12 R(quad) is weakly automatizable iff R(lin) is weakly automatizable. 

Since R(quad) admits polynomial-size refutations of the 3XOR principle, and since weak 
automatizability entails feasible interpolation, we get a reduction of the problem of deter- 
minizing Feige et al. non-deterministic refutation algorithm to the problem of establishing 
weak automatizability of R(lin) : 

Corollary 13 If R(lin) is weakly automatizable then there is a deterministic refutation al- 
gorithm for random 3CNFs with 0(n 1,4 ) clauses. 

8 Conclusions 

We reduced the algorithmic problem of constructing a deterministic refutation algorithm for 
random 3CNFs with n variables and Q(n 1A ) clauses to the (nondeterministic) problem of 
proving the existence of a short proof of a certain tautology in somewhat weak proof systems, 
and also to the problem of establishing weak automatizability of R(lin) . 

We do not believe that there are polynomial-size refutations of the 3XOR principle in 
cutting planes or R(lin), since, loosely speaking, it is impossible in these proof systems to 
compose two mappings, an ability that seems to be inevitable for any proof system that can 
refute efficiently the 3XOR principle. 

It should also be noted that it might be possible to use similar arguments to Atserias and 
Bonet [AB04] to show that R(quad) has feasible interpolation iff R(lin) is weakly automatiz- 
able. 

Finally, it might be possible that the 3XOR principle is already efficiently provable in 
an apparently weaker proof system than R(quad), namely a proof system that operates 
with proof-lines that are disjunctions of constant many generalized quadratic equations (note 
that we did not assume any bound on the number of disjunctions in an R(quad) proof-line). 
This would reduce the problem of determinizing Feige et al. non-deterministic algorithm to 
the weak automatizability of the proof system R°(lin) introduced in [RT08], for which it is 
already known that the feasible interpolation holds by [RT08] (though weak automatizability 
is unknown for it). 
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